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METHOD AND APPARATUS FOR HYBRID-TYPE 
HIGH SPEED MOTION ESTIMATION 



BACKGROUND OF THE INVENTION 



Field of the Invention 



The present invention relates to a method and apparatus for hybrid-type high speed 
motion estimation for application to a moving picture encoding apparatus, and more 
particularly, to a method and apparatus for motion estimation combining a one-pixel greedy 
search (OPGS) algorithm and a hierarchical search block matching algorithm (HSBMA). 
2. Description of the Related Art 

In general, international video standards, such as H.261, H.263, Moving Picture 
Experts Group (MPEG)-l, MPEG-2, and MPEG-4, have been adopted for video services, 
entertainment, digital broadcasting, and portable video terminals. A video encoder for 
generating a bit stream according to the international video standards compresses a video 
signal using compression algorithms, such as discrete cosine transform (DCT), quantization, 
and variable length coding, and a motion estimation algorithm. 

Existing motion estimation algorithms include a full-search block matching algorithm 
(FSBMA) and a fast search algorithm (FSA). The FSBMA first obtains differences one by 
one between all possible locations within a search area in the previous frame and 
corresponding locations in the current frame, and then finds a location having the minimum 
error. However, though the FSBMA is the simplest and idealistically accurate, the FSBMA 
requires a huge amount of calculation and therefore is not appropriate for real-time encoding. 

Meanwhile, compared to the FSBMA, the FSA greatly reduces the amount of 
calculation at the cost of less accuracy, and is appropriate for real-time video encoders (for 
example, video telephones, IMT-2000 terminals, video conference systems, etc.), in which 
video quality is relatively less important. Examples of the FSA include a hierarchical search 




block matching algorithm (HSBMA), a three-step search (TSS) algorithm, a 2D logarithmic 
search algorithm (LSBA), and a one-pixel greedy search (OPGS). 

Here, the HSBMA has high accuracy and is relatively less affected by the amount of 
motion, but involves a large number of calculations, and requires a memory for storing low 

5 resolution frames. Also, the HSBMA requires a large number of calculations both for a long 
distance motion vector and a short distance motion vector without distinction. 

The OPGS algorithm can find only an effective motion vector near a central point (or 
a starting point), may incorrectly converge on a local minimum point, may not obtain the 
correct result in a complex image having complex motion, and requires a large number of 

10 calculations to find a motion vector over a long distance. 

Therefore, existing motion estimation algorithms cannot utilize the already calculated 
results and an opportunity to reduce unnecessary calculations, because each of these 
algorithms is applied uniformly to all blocks, regardless of the complexity of motion, and 
regardless of the characteristics of a subject block. Therefore, existing motion estimation 

15 algorithms can be properly implemented in hardware, such as a very large-scale integration 
(VLSI) chip, but it is not efficient to implement the algorithms in a software-dedicated 
encoder. Also, an inexpensive price, low voltage central processing unit (CPU) cannot be 
used to implement the algorithms in software due to the large number of calculations. 



SUMMARY OF THE INVENTION 

To solve the above problems, it is a first object of the present invention to provide a 
motion estimation method in which unnecessary calculations are minimized and accuracy is 
enhanced by performing hybrid-type motion estimation combining a one-pixel greedy search 
(OPGS) algorithm and a hierarchical search block matching algorithm (HSBMA). 
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It is a second object to provide a motion estimation apparatus to which a motion 
estimation method combining the OPGS algorithm and the HSBMA algorithm is applied. 

To accomplish the first object of the present invention, there is provided an adaptive 
motion estimation method, the motion estimation method having the steps of (a) inputting a 
frame in units of macro blocks and a search area and estimating candidate motion vectors for 
a macro block desired to be estimated; and (b) if an error of the candidate motion vectors 
estimated in step (a) is in a threshold range, estimating motion in a restricted search area 
centered on the estimated location, and otherwise, estimating motion in the whole search 
area. 

To accomplish the second object of the present invention, there is also provided an 
adaptive motion estimation apparatus, the motion estimation apparatus having a vector 
estimation unit for receiving video data, and estimating a motion vector matching a macro 
block desired to be estimated by selecting from among a zero motion vector, the previous 
motion vector, and the motion vectors of neighboring blocks as candidate motion vectors; an 
algorithm selecting unit for selecting a motion estimation algorithm by comparing an error of 
the candidate vector estimated in the candidate vector estimation unit with a preset threshold; 
and a motion estimation unit for estimating motion in a restricted search area, centered on the 
estimated location, if an error of the candidate motion vectors estimated in the algorithm 
selecting unit is in a threshold range, and otherwise, estimating motion in the whole search 
area. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above objects and advantages of the present invention will become more apparent 
by describing in detail a preferred embodiment thereof with reference to the attached 
drawings in which: 



FIG. 1 is a block diagram of an apparatus for hybrid-type high speed motion 
estimation according to the present invention; 

FIG. 2 is a flowchart of a method for hybrid-type high speed motion estimation 
according to the present invention; 

FIG. 3 is a graph showing the distribution of macro blocks by motion vector size 
obtained by performing a full-search block matching algorithm (FSBMA); 

FIG. 4 is a conceptual diagram showing estimation of a candidate vector in the 
previous frame and next frame, according to FIG. 2; 

FIG. 5 is a conceptual diagram for a one-pixel greedy search (OPGS) algorithm 
according to FIG. 2; and 

FIG. 6 is a conceptual diagram for a hierarchical search block matching algorithm 
(HSBMA) according to FIG. 2. 

DETAILED DESCRIPTION OF THE INVENTION 

Hereinafter, embodiments of the present invention will be described in detail with 
reference to the attached drawings. The present invention is not restricted to the following 
embodiments, and many variations are possible within the spirit and scope of the present 
invention. The embodiments of the present invention are provided in order to more 
completely explain the present invention to anyone skilled in the art. 

FIG. 1 is a block diagram of an apparatus for hybrid-type high speed motion 
estimation according to the present invention. 

The apparatus of FIG. 1 has a candidate vector estimation unit 1 10, an algorithm 
selecting unit 120, a motion estimation unit 130, a memory 140, and a half pixel motion 
estimation unit 150. 



Referring to FIG. 1, the candidate vector estimation unit 110 receives video data and 
estimates a candidate vector for a macro block to be estimated at present. At this time, as a 
candidate motion vector, the candidate vector estimation unit 110 selects the best matching 
motion vector among a zero motion vector, a previous motion vector, and the motion vectors 
of neighboring blocks. 

The algorithm selecting unit 120 selects a motion estimation algorithm (the OPGS or 
the HSBMA), by comparing the sum of the absolute difference (SAD) between the candidate 
vector estimated by the candidate vector estimation unit 1 10 and a predetermined threshold. 

The motion estimation unit 130 performs full pixel motion estimation of a macro 
block input by a motion algorithm (the OPGS or the HSBMA) selected by the algorithm 
selecting unit 120. 

A memory 140 stores a full pixel motion estimated value estimated by the motion 
estimation unit 130 and applies the estimated value to the candidate vector estimation unit 
110. 

With video data input, the half pixel motion estimation unit 150 estimates the half 
pixel motion of a 16x16 macro block and 8x8 sub-block, referring to the location of the full 
pixel motion estimated value estimated by the motion estimation unit 130. 

FIG. 2 is a flowchart for a method for hybrid-type high speed motion estimation 
according to the present invention. 

Referring to FIGS. 3 through 6, the flowchart of the method for high speed motion 
estimation shown in FIG. 2 will now be explained. 

First, input video data (or a frame) is divided into macro blocks that are desired to be 
searched, and a search area of the previous or next frame for a macro block is set in step 210. 

Then, a candidate vector for the macro block of which a motion vector is desired to be 
estimated is estimated in step 220. For example, as shown in FIG. 3, the macro block 




distribution by motion size, obtained by performing the FSBMA with video data, shows that 
a good many motion vectors have lengths between "0" and "1". Therefore, to obtain these 
motion vectors, a motion estimation algorithm, which can reduce calculations referring to the 
distribution degree, as shown in FIG. 3, 

5 is applied instead of applying only one motion estimation algorithm to all macro blocks. 

Therefore, the best matching value among (1) a zero motion vector, (2) motion 
vectors of neighboring blocks, and (3) the previous motion vector, for example a vector value 
having the minimum SAD is set as a candidate motion vector. 
Here, three candidate motion vectors are set as follows. 

10 A first candidate motion vector (1) is set to a motion vector having a length of "0" 

(zero motion vector). 

A second candidate motion vector (2) is set to a median value of motion vectors of 
three macro blocks 420, 430, and 440, of which motion vectors have already been calculated, 
centered on the current macro block, as shown in (b) of FIG. 4. However, if the current 

is frame is a bidirectional (B)-type, one among a forward motion vector, a backward motion 
vector, and a bidirectional motion vector in the neighboring macro blocks is selected. 
Therefore, if one of the three direction motion vectors is known, the remaining direction 
motion vectors can be estimated by scaling the motion vector that is already known. For 
example, if a forward motion vector is known, the forward motion vector is appropriately 

20 scaled according to the number of intervals of a reference frame, and by inverting the sign of 
the scaled motion vector, a backward motion vector is estimated. Likewise, a backward 
motion vector can be appropriately changed into a forward motion vector. Therefore, after 
obtaining a median by calculating the three types of motion vectors (forward, backward, and 
bidirectional) for each of the neighboring macro blocks 420, 430, and 440, the motion vector 
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best matching the macro block desired to be estimated is set as the second candidate motion 
vector (2). 

As shown in (a) of FIG. 4, if the current frame is a predictive (P)-type frame, a third 
candidate motion vector (3) is set to a motion vector 410 of a macro block at the same 

5 location in the current frame as the location in the previous P-t; pe frame, or is set to a media' : 
of the motion vectors of the five macro blocks including the 4 neighboring macro blocks. At 
this time, if the current frame is a B-type frame, the motion vector of a macro block at the 
same location of the current frame as the previous or next P-type frame, which is used to 
estimate the motion vector of the current frame, is estimated by scaling. After scaling the 

to four neighboring macro blocks, a median is set as the third candidate motion vector (3). At 
this time, if one directional component of a macro block is known, motion vectors of the 
remaining directions (forward, backward, and bidirectional) can be estimated through scaling 
and inverting the sign, as in the method used to estimate the second candidate motion vector. 
Then, a motion estimation algorithm is selected by comparing the value best matching 

15 the search area among the candidate motion vectors (for example, using the minimum SAD), 
with the predetermined threshold (T), in step 230. Here, if moving pictures are encoded real 
time in a multitasking environment, a shortage of CPU processing power may occur. In this 
case, if a constant frame rate is desired, a target encoding time for each frame is calculated in 
advance. Therefore, a threshold (T) is adjusted by estimating an encoding time for each slice 

20 (a group of a series of macro blocks) for the current frame based on the target encoding time 
calculated in advance. 

Then, if the minimum SAD value corresponding to the value best matching the macro 
block to be estimated, among the candidate motion vectors, is within the range of the 
threshold (T), the OPGS algorithm is selected. At this time, according to the known OPGS 

25 algorithm, a motion vector is searched for in a more limited range of 1/2 or 1/4 of the 



absolute value which a motion vector that is defined by "f code" can be, centered on the 
location corresponding to the estimated candidate motion vector, in step 240. At this time, 
the f code indicates a maximum search range and a minimum search range within which a 
motion vector can be. Referring to FIG. 5, ® is a starting point corresponding to an estimated 
5 locatior , like (a) and (b) of FIG. 4. Centered on the starting point (J)), matching for each of 
four locations indicated by © is performed, and then matching for each of four locations 
indicated by (D or ® is performed, and then matching is performed repeatedly until best 
matching neighboring values do not exist. In this way, a motion vector which is the best 
matching location corresponding to the result (®) is finally converged. 

10 Here, for H.263 and MPEG-4 standards, the OPGS usually performs 8x8 sub-block 

motion estimation (advanced prediction mode or 4MV mode) in a range within ±2 of the 
motion vector of a macro block. However, in the present invention, 16x16 macro block 
OPGS is performed and then OPGS for each 8x8 sub-block is performed in a range within ±2 
of the motion vector. Also, an unrestricted motion vector in an extended area is estimated 

is through repetitive padding as defined in the standards. 

Then, if estimation fails because the best matching value among the candidate motion 
vectors, that is, SAD, is outside of the threshold, the HSBMA algorithm is performed. The 
HSBMA algorithm performs motion estimation for the entire search area indicated by the f 
code. Referring to FIG. 6, the embodiment will now be explained. From a search area 

20 formed of a low resolution or sub-sampled image (for example: [-2, +2]), a motion vector is 
searched for in stage 1. Using the search result of stage 1, a precise motion vector is searched 
for in a restricted search area in a high resolution image or the original image in stage 2. This 
process is continuously repeated until the highest resolution (or the original image) is 
reached, and then the best matching block location is set to the motion vector. 
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Also, HSBMA uses a spiral search, and, if the matching degree is good enough when 
the motion vector which is calculated in each stage (stage 1, and stage 2) and compared to the 
preset threshold, the motion vector is selected as the final estimated value. Here, the 
threshold determines the range of allowable error, determines the accuracy and calculation 
amount of HSBMA, and is selected depending on an estimated encoding time. 

Here, when a motion vector for a 8x8 sub-block of is estimated, HSBMA estimates 
the motion of each of four sub-blocks in stage 2, and estimates the motion of a macro block 
by adding the matching values of the four blocks. Also, when necessary, HSBMA estimates 
an unrestricted motion vector for an extended search area after repetitive padding. 

Then, motion vectors, by pixel unit, estimated by the OPGS algorithm or the HSBMA 
algorithm are stored in units of 16x16 macro blocks, and a motion vector by half pixel is 
estimated on the basis of pixel unit motion estimation in steps 260 and 270. 

Then, motion vectors of 16x16 macro block, 8x8 sub-block, and half pixel are 
extracted in step 280. 

According to the present invention as described above, a motion vector is estimated, 
and then OPGS is performed centered on the estimated location, and if the estimation fails, 
the estimation is compensated by HSBMA to prevent errors due to the inaccurate estimation. 
By doing so, high accuracy can be maintained while the amount of calculation can be 
reduced. The present invention is particularly effective in a real time encoder. 
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